Unsupervised single-shot depth estimation using perceptual reconstruction

نویسندگان

چکیده

Abstract Real-time estimation of actual object depth is an essential module for various autonomous system tasks such as 3D reconstruction, scene understanding and condition assessment. During the last decade machine learning, extensive deployment deep learning methods to computer vision has yielded approaches that succeed in achieving realistic synthesis out a simple RGB modality. Most these models are based on paired RGB-depth data and/or availability video sequences stereo images. However, lack pairs, sequences, or images makes challenging task needs be explored more detail. This study builds recent advances field generative neural networks order establish fully unsupervised single-shot estimation. Two generators RGB-to-depth depth-to-RGB transfer implemented simultaneously optimized using Wasserstein-1 distance, novel perceptual reconstruction term, hand-crafted image filters. We comprehensively evaluate custom-generated industrial surface set well Texas Face Recognition Database, CelebAMask-HQ database human portraits SURREAL dataset records body depth. For each evaluation dataset, proposed method shows significant increase accuracy compared state-of-the-art single-image methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised learning through one-shot image-based shape reconstruction

Objects are three-dimensional entities, but visual observations are largely 2D. Inferring 3D properties from individual 2D views is thus a generically useful skill that is critical to object perception. We ask the question: can we learn useful image representations by explicitly training a system to infer 3D shape from 2D views? The few prior attempts at single view 3D reconstruction all target...

متن کامل

Single-shot Doppler Velocity Estimation using double chirp pulse compression

Coherent ultrasonic Doppler velocimeters provide precise and accurate measurements of velocity profiles in many applications. However, these instruments suffer from the well known range-velocity ambiguity, making this kind of instruments not well suited for velocity measurements in channels of several meters depth and water velocities of some meters per second. On the other side, incoherent ult...

متن کامل

Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction

Despite learning based methods showing promising results in single view depth estimation and visual odometry, most existing approaches treat the tasks in a supervised manner. Recent approaches to single view depth estimation explore the possibility of learning without full supervision via minimizing photometric error. In this paper, we explore the use of stereo sequences for learning depth and ...

متن کامل

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue

A significant weakness of most current deep Convolutional Neural Networks is the need to train them using vast amounts of manually labelled data. In this work we propose a unsupervised framework to learn a deep convolutional neural network for single view depth prediction, without requiring a pre-training stage or annotated ground truth depths. We achieve this by training the network in a manne...

متن کامل

Depth Estimation for Mobile Robot Using Single Omnidirectional Camera System

Described here is a new method for depth estimation using a single omnidirectional visual sensor embedded on an autonomous mobile robot. This work is part of an on-going research project to study the visual guidance of autonomous robots. The method is based on a vertically aligned omnistereo configuration and laws of reflection applied on a geometric optics field. The proposed system yields a c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Machine Vision and Applications

سال: 2023

ISSN: ['1432-1769', '0932-8092']

DOI: https://doi.org/10.1007/s00138-023-01410-5